Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 827 |
| Missing cells | 834 |
| Missing cells (%) | 5.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 129.3 KiB |
| Average record size in memory | 160.2 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 9 |
trestbps is highly correlated with trestbpd | High correlation |
met is highly correlated with df_index and 2 other fields | High correlation |
thalach is highly correlated with met | High correlation |
trestbpd is highly correlated with trestbps and 1 other fields | High correlation |
oldpeak is highly correlated with exang and 1 other fields | High correlation |
num is highly correlated with oldpeak | High correlation |
df_index is highly correlated with restecg and 2 other fields | High correlation |
cp is highly correlated with exang | High correlation |
restecg is highly correlated with df_index | High correlation |
tpeakbpd is highly correlated with trestbpd | High correlation |
exang is highly correlated with cp and 1 other fields | High correlation |
dataset is highly correlated with df_index and 1 other fields | High correlation |
trestbps has 60 (7.3%) missing values | Missing |
htn has 33 (4.0%) missing values | Missing |
fbs has 66 (8.0%) missing values | Missing |
pro has 65 (7.9%) missing values | Missing |
met has 107 (12.9%) missing values | Missing |
thalach has 57 (6.9%) missing values | Missing |
thalrest has 58 (7.0%) missing values | Missing |
tpeakbps has 64 (7.7%) missing values | Missing |
tpeakbpd has 64 (7.7%) missing values | Missing |
trestbpd has 60 (7.3%) missing values | Missing |
exang has 57 (6.9%) missing values | Missing |
oldpeak has 60 (7.3%) missing values | Missing |
rldv5e has 71 (8.6%) missing values | Missing |
df_index has unique values | Unique |
oldpeak has 322 (38.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-10-17 20:35:48.484081 |
|---|---|
| Analysis finished | 2022-10-17 20:35:58.114598 |
| Duration | 9.63 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 827 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 430.9226119 |
| Minimum | 0 |
|---|---|
| Maximum | 900 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 41.3 |
| Q1 | 206.5 |
| median | 413 |
| Q3 | 620.5 |
| 95-th percentile | 858.7 |
| Maximum | 900 |
| Range | 900 |
| Interquartile range (IQR) | 414 |
Descriptive statistics
| Standard deviation | 263.2545032 |
|---|---|
| Coefficient of variation (CV) | 0.6109090031 |
| Kurtosis | -1.150818063 |
| Mean | 430.9226119 |
| Median Absolute Deviation (MAD) | 207 |
| Skewness | 0.1687656497 |
| Sum | 356373 |
| Variance | 69302.93347 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.1% |
| 555 | 1 | 0.1% |
| 545 | 1 | 0.1% |
| 546 | 1 | 0.1% |
| 547 | 1 | 0.1% |
| 548 | 1 | 0.1% |
| 549 | 1 | 0.1% |
| 550 | 1 | 0.1% |
| 551 | 1 | 0.1% |
| 552 | 1 | 0.1% |
| Other values (817) | 817 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 900 | 1 | |
| 899 | 1 | |
| 898 | 1 | |
| 897 | 1 | |
| 896 | 1 | |
| 895 | 1 | |
| 894 | 1 | |
| 893 | 1 | |
| 892 | 1 | |
| 891 | 1 |
age
Real number (ℝ≥0)
| Distinct | 49 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 2 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53.30181818 |
| Minimum | 28 |
|---|---|
| Maximum | 77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 28 |
|---|---|
| 5-th percentile | 37 |
| Q1 | 47 |
| median | 54 |
| Q3 | 60 |
| 95-th percentile | 68 |
| Maximum | 77 |
| Range | 49 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.489509235 |
|---|---|
| Coefficient of variation (CV) | 0.1780334998 |
| Kurtosis | -0.3877835298 |
| Mean | 53.30181818 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.1563824897 |
| Sum | 43974 |
| Variance | 90.05078553 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 54 | 48 | 5.8% |
| 58 | 38 | 4.6% |
| 55 | 36 | 4.4% |
| 52 | 33 | 4.0% |
| 57 | 32 | 3.9% |
| 51 | 32 | 3.9% |
| 59 | 31 | 3.7% |
| 62 | 31 | 3.7% |
| 56 | 30 | 3.6% |
| 48 | 30 | 3.6% |
| Other values (39) | 484 |
| Value | Count | Frequency (%) |
| 28 | 1 | 0.1% |
| 29 | 3 | 0.4% |
| 30 | 1 | 0.1% |
| 31 | 2 | 0.2% |
| 32 | 5 | |
| 33 | 2 | 0.2% |
| 34 | 7 | |
| 35 | 9 | |
| 36 | 5 | |
| 37 | 11 |
| Value | Count | Frequency (%) |
| 77 | 2 | 0.2% |
| 76 | 2 | 0.2% |
| 75 | 3 | 0.4% |
| 74 | 7 | |
| 72 | 4 | 0.5% |
| 71 | 5 | 0.6% |
| 70 | 5 | 0.6% |
| 69 | 12 | |
| 68 | 8 | |
| 67 | 13 |
sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 2 |
| Missing (%) | 0.2% |
| Memory size | 6.6 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2475 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 643 | |
| 0.0 | 182 | 22.0% |
| (Missing) | 2 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 643 | |
| 0.0 | 182 | 22.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1007 | |
| . | 825 | |
| 1 | 643 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1650 | |
| Other Punctuation | 825 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1007 | |
| 1 | 643 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 825 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2475 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1007 | |
| . | 825 | |
| 1 | 643 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2475 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1007 | |
| . | 825 | |
| 1 | 643 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 2 |
| Missing (%) | 0.2% |
| Memory size | 6.6 KiB |
| 4.0 | |
|---|---|
| 3.0 | |
| 2.0 | |
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2475 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 2.0 |
| 4th row | 4.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 4.0 | 426 | |
| 3.0 | 190 | |
| 2.0 | 166 | 20.1% |
| 1.0 | 43 | 5.2% |
| (Missing) | 2 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 4.0 | 426 | |
| 3.0 | 190 | |
| 2.0 | 166 | 20.1% |
| 1.0 | 43 | 5.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 825 | |
| 0 | 825 | |
| 4 | 426 | |
| 3 | 190 | 7.7% |
| 2 | 166 | 6.7% |
| 1 | 43 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1650 | |
| Other Punctuation | 825 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 825 | |
| 4 | 426 | |
| 3 | 190 | 11.5% |
| 2 | 166 | 10.1% |
| 1 | 43 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 825 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2475 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 825 | |
| 0 | 825 | |
| 4 | 426 | |
| 3 | 190 | 7.7% |
| 2 | 166 | 6.7% |
| 1 | 43 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2475 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 825 | |
| 0 | 825 | |
| 4 | 426 | |
| 3 | 190 | 7.7% |
| 2 | 166 | 6.7% |
| 1 | 43 | 1.7% |
| Distinct | 58 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 60 |
| Missing (%) | 7.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 131.7014342 |
| Minimum | 80 |
|---|---|
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 80 |
|---|---|
| 5-th percentile | 105 |
| Q1 | 120 |
| median | 130 |
| Q3 | 140 |
| 95-th percentile | 160 |
| Maximum | 200 |
| Range | 120 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 18.25102838 |
|---|---|
| Coefficient of variation (CV) | 0.1385788128 |
| Kurtosis | 0.5614984919 |
| Mean | 131.7014342 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.5820909421 |
| Sum | 101015 |
| Variance | 333.1000371 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 120 | 118 | |
| 130 | 106 | |
| 140 | 93 | 11.2% |
| 110 | 54 | 6.5% |
| 150 | 50 | 6.0% |
| 160 | 41 | 5.0% |
| 125 | 25 | 3.0% |
| 128 | 16 | 1.9% |
| 100 | 15 | 1.8% |
| 138 | 14 | 1.7% |
| Other values (48) | 235 | |
| (Missing) | 60 | 7.3% |
| Value | Count | Frequency (%) |
| 80 | 1 | 0.1% |
| 92 | 1 | 0.1% |
| 94 | 2 | 0.2% |
| 95 | 6 | 0.7% |
| 96 | 1 | 0.1% |
| 98 | 1 | 0.1% |
| 100 | 15 | |
| 101 | 1 | 0.1% |
| 102 | 3 | 0.4% |
| 104 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 200 | 2 | 0.2% |
| 192 | 1 | 0.1% |
| 190 | 2 | 0.2% |
| 180 | 11 | 1.3% |
| 178 | 3 | 0.4% |
| 174 | 1 | 0.1% |
| 172 | 2 | 0.2% |
| 170 | 12 | 1.5% |
| 165 | 1 | 0.1% |
| 160 | 41 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 33 |
| Missing (%) | 4.0% |
| Memory size | 6.6 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2382 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 401 | |
| 1.0 | 393 | |
| (Missing) | 33 | 4.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 401 | |
| 1.0 | 393 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1195 | |
| . | 794 | |
| 1 | 393 | 16.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1588 | |
| Other Punctuation | 794 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1195 | |
| 1 | 393 | 24.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 794 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2382 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1195 | |
| . | 794 | |
| 1 | 393 | 16.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2382 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1195 | |
| . | 794 | |
| 1 | 393 | 16.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 66 |
| Missing (%) | 8.0% |
| Memory size | 6.6 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2283 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 631 | |
| 1.0 | 130 | 15.7% |
| (Missing) | 66 | 8.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 631 | |
| 1.0 | 130 | 17.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1392 | |
| . | 761 | |
| 1 | 130 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1522 | |
| Other Punctuation | 761 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1392 | |
| 1 | 130 | 8.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 761 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2283 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1392 | |
| . | 761 | |
| 1 | 130 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2283 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1392 | |
| . | 761 | |
| 1 | 130 | 5.7% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 4 |
| Missing (%) | 0.5% |
| Memory size | 6.6 KiB |
| 0.0 | |
|---|---|
| 2.0 | |
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2469 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 486 | |
| 2.0 | 178 | 21.5% |
| 1.0 | 159 | 19.2% |
| (Missing) | 4 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 486 | |
| 2.0 | 178 | 21.6% |
| 1.0 | 159 | 19.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1309 | |
| . | 823 | |
| 2 | 178 | 7.2% |
| 1 | 159 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1646 | |
| Other Punctuation | 823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1309 | |
| 2 | 178 | 10.8% |
| 1 | 159 | 9.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 823 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2469 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1309 | |
| . | 823 | |
| 2 | 178 | 7.2% |
| 1 | 159 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2469 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1309 | |
| . | 823 | |
| 2 | 178 | 7.2% |
| 1 | 159 | 6.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 65 |
| Missing (%) | 7.9% |
| Memory size | 6.6 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2286 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 647 | |
| 1.0 | 115 | 13.9% |
| (Missing) | 65 | 7.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 647 | |
| 1.0 | 115 | 15.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1409 | |
| . | 762 | |
| 1 | 115 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1524 | |
| Other Punctuation | 762 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1409 | |
| 1 | 115 | 7.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 762 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2286 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1409 | |
| . | 762 | |
| 1 | 115 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2286 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1409 | |
| . | 762 | |
| 1 | 115 | 5.0% |
| Distinct | 28 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 107 |
| Missing (%) | 12.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.3675 |
| Minimum | 2 |
|---|---|
| Maximum | 18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 5 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 13 |
| Maximum | 18 |
| Range | 16 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.097255167 |
|---|---|
| Coefficient of variation (CV) | 0.4203943219 |
| Kurtosis | 0.2787169346 |
| Mean | 7.3675 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.7197781468 |
| Sum | 5304.6 |
| Variance | 9.592989569 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 107 | |
| 5 | 106 | |
| 6 | 102 | |
| 9 | 71 | |
| 10 | 58 | |
| 4 | 54 | |
| 8 | 51 | |
| 3 | 31 | 3.7% |
| 13 | 27 | 3.3% |
| 12 | 23 | 2.8% |
| Other values (18) | 90 | |
| (Missing) | 107 |
| Value | Count | Frequency (%) |
| 2 | 22 | 2.7% |
| 2.5 | 1 | 0.1% |
| 3 | 31 | 3.7% |
| 3.5 | 1 | 0.1% |
| 4 | 54 | |
| 4.5 | 1 | 0.1% |
| 5 | 106 | |
| 5.4 | 1 | 0.1% |
| 5.8 | 1 | 0.1% |
| 6 | 102 |
| Value | Count | Frequency (%) |
| 18 | 2 | 0.2% |
| 17 | 2 | 0.2% |
| 16 | 7 | 0.8% |
| 15 | 3 | 0.4% |
| 14 | 20 | 2.4% |
| 13 | 27 | 3.3% |
| 12 | 23 | 2.8% |
| 11 | 19 | 2.3% |
| 10 | 58 | |
| 9 | 71 |
| Distinct | 111 |
|---|---|
| Distinct (%) | 14.4% |
| Missing | 57 |
| Missing (%) | 6.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 139.5376623 |
| Minimum | 69 |
|---|---|
| Maximum | 202 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 69 |
|---|---|
| 5-th percentile | 98 |
| Q1 | 120 |
| median | 140 |
| Q3 | 159.75 |
| 95-th percentile | 179 |
| Maximum | 202 |
| Range | 133 |
| Interquartile range (IQR) | 39.75 |
Descriptive statistics
| Standard deviation | 25.01259521 |
|---|---|
| Coefficient of variation (CV) | 0.1792533628 |
| Kurtosis | -0.5570110255 |
| Mean | 139.5376623 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.1806678604 |
| Sum | 107444 |
| Variance | 625.6299191 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 150 | 40 | 4.8% |
| 140 | 38 | 4.6% |
| 120 | 28 | 3.4% |
| 130 | 27 | 3.3% |
| 160 | 26 | 3.1% |
| 170 | 20 | 2.4% |
| 110 | 19 | 2.3% |
| 125 | 18 | 2.2% |
| 142 | 14 | 1.7% |
| 162 | 13 | 1.6% |
| Other values (101) | 527 | |
| (Missing) | 57 | 6.9% |
| Value | Count | Frequency (%) |
| 69 | 1 | 0.1% |
| 71 | 1 | 0.1% |
| 73 | 1 | 0.1% |
| 77 | 1 | 0.1% |
| 80 | 2 | |
| 82 | 3 | |
| 84 | 3 | |
| 86 | 3 | |
| 87 | 1 | 0.1% |
| 88 | 2 |
| Value | Count | Frequency (%) |
| 202 | 1 | 0.1% |
| 195 | 1 | 0.1% |
| 194 | 1 | 0.1% |
| 192 | 1 | 0.1% |
| 190 | 2 | |
| 188 | 2 | |
| 187 | 1 | 0.1% |
| 186 | 2 | |
| 185 | 4 | |
| 184 | 4 |
| Distinct | 73 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 58 |
| Missing (%) | 7.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76.18335501 |
| Minimum | 37 |
|---|---|
| Maximum | 139 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 37 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 66 |
| median | 74 |
| Q3 | 85 |
| 95-th percentile | 100 |
| Maximum | 139 |
| Range | 102 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 14.62398265 |
|---|---|
| Coefficient of variation (CV) | 0.1919577137 |
| Kurtosis | 0.7982786077 |
| Mean | 76.18335501 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.6582926207 |
| Sum | 58585 |
| Variance | 213.8608684 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 70 | 36 | 4.4% |
| 80 | 30 | 3.6% |
| 74 | 30 | 3.6% |
| 68 | 27 | 3.3% |
| 75 | 26 | 3.1% |
| 84 | 25 | 3.0% |
| 73 | 24 | 2.9% |
| 64 | 23 | 2.8% |
| 78 | 23 | 2.8% |
| 72 | 22 | 2.7% |
| Other values (63) | 503 | |
| (Missing) | 58 | 7.0% |
| Value | Count | Frequency (%) |
| 37 | 1 | 0.1% |
| 40 | 1 | 0.1% |
| 43 | 1 | 0.1% |
| 46 | 1 | 0.1% |
| 47 | 1 | 0.1% |
| 49 | 4 | |
| 50 | 2 | 0.2% |
| 51 | 1 | 0.1% |
| 52 | 8 | |
| 53 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 139 | 1 | 0.1% |
| 134 | 1 | 0.1% |
| 125 | 3 | |
| 124 | 1 | 0.1% |
| 120 | 3 | |
| 119 | 1 | 0.1% |
| 116 | 1 | 0.1% |
| 115 | 2 | 0.2% |
| 112 | 2 | 0.2% |
| 110 | 5 |
| Distinct | 73 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 64 |
| Missing (%) | 7.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 172.2961992 |
| Minimum | 84 |
|---|---|
| Maximum | 240 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 84 |
|---|---|
| 5-th percentile | 130 |
| Q1 | 156 |
| median | 170 |
| Q3 | 190 |
| 95-th percentile | 220 |
| Maximum | 240 |
| Range | 156 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 25.76185758 |
|---|---|
| Coefficient of variation (CV) | 0.1495207538 |
| Kurtosis | 0.1998375257 |
| Mean | 172.2961992 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | 0.0428102665 |
| Sum | 131462 |
| Variance | 663.6733057 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 180 | 87 | 10.5% |
| 160 | 82 | 9.9% |
| 170 | 74 | 8.9% |
| 190 | 60 | 7.3% |
| 200 | 54 | 6.5% |
| 150 | 43 | 5.2% |
| 140 | 34 | 4.1% |
| 220 | 23 | 2.8% |
| 210 | 21 | 2.5% |
| 165 | 17 | 2.1% |
| Other values (63) | 268 | |
| (Missing) | 64 | 7.7% |
| Value | Count | Frequency (%) |
| 84 | 1 | 0.1% |
| 90 | 1 | 0.1% |
| 92 | 1 | 0.1% |
| 98 | 2 | 0.2% |
| 100 | 1 | 0.1% |
| 110 | 3 | |
| 112 | 1 | 0.1% |
| 116 | 1 | 0.1% |
| 120 | 6 | |
| 124 | 3 |
| Value | Count | Frequency (%) |
| 240 | 5 | 0.6% |
| 235 | 1 | 0.1% |
| 232 | 1 | 0.1% |
| 230 | 14 | |
| 228 | 1 | 0.1% |
| 224 | 1 | 0.1% |
| 220 | 23 | |
| 216 | 1 | 0.1% |
| 215 | 3 | 0.4% |
| 210 | 21 |
| Distinct | 51 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 64 |
| Missing (%) | 7.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87.42070773 |
| Minimum | 11 |
|---|---|
| Maximum | 134 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 64 |
| Q1 | 80 |
| median | 90 |
| Q3 | 100 |
| 95-th percentile | 110 |
| Maximum | 134 |
| Range | 123 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 15.05719423 |
|---|---|
| Coefficient of variation (CV) | 0.1722383017 |
| Kurtosis | 0.8726135898 |
| Mean | 87.42070773 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -0.1655087739 |
| Sum | 66702 |
| Variance | 226.7190982 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 136 | |
| 90 | 120 | |
| 100 | 108 | |
| 70 | 52 | 6.3% |
| 110 | 40 | 4.8% |
| 95 | 24 | 2.9% |
| 75 | 23 | 2.8% |
| 60 | 22 | 2.7% |
| 78 | 22 | 2.7% |
| 84 | 16 | 1.9% |
| Other values (41) | 200 | |
| (Missing) | 64 | 7.7% |
| Value | Count | Frequency (%) |
| 11 | 1 | 0.1% |
| 26 | 1 | 0.1% |
| 40 | 2 | 0.2% |
| 45 | 1 | 0.1% |
| 50 | 2 | 0.2% |
| 55 | 1 | 0.1% |
| 56 | 2 | 0.2% |
| 58 | 3 | 0.4% |
| 60 | 22 | |
| 62 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 134 | 1 | 0.1% |
| 130 | 2 | 0.2% |
| 120 | 15 | 1.8% |
| 118 | 4 | 0.5% |
| 116 | 2 | 0.2% |
| 115 | 7 | 0.8% |
| 114 | 2 | 0.2% |
| 112 | 1 | 0.1% |
| 110 | 40 | |
| 108 | 2 | 0.2% |
| Distinct | 32 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 60 |
| Missing (%) | 7.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83.54628422 |
| Minimum | 50 |
|---|---|
| Maximum | 110 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 70 |
| Q1 | 80 |
| median | 80 |
| Q3 | 90 |
| 95-th percentile | 100 |
| Maximum | 110 |
| Range | 60 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 9.639460546 |
|---|---|
| Coefficient of variation (CV) | 0.1153786866 |
| Kurtosis | -0.001920627627 |
| Mean | 83.54628422 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -0.004938102035 |
| Sum | 64080 |
| Variance | 92.91919962 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 241 | |
| 90 | 148 | |
| 70 | 82 | 9.9% |
| 100 | 59 | 7.1% |
| 85 | 27 | 3.3% |
| 78 | 24 | 2.9% |
| 95 | 21 | 2.5% |
| 82 | 15 | 1.8% |
| 84 | 14 | 1.7% |
| 75 | 14 | 1.7% |
| Other values (22) | 122 | |
| (Missing) | 60 | 7.3% |
| Value | Count | Frequency (%) |
| 50 | 2 | 0.2% |
| 58 | 1 | 0.1% |
| 60 | 10 | 1.2% |
| 64 | 4 | 0.5% |
| 65 | 4 | 0.5% |
| 66 | 1 | 0.1% |
| 68 | 4 | 0.5% |
| 70 | 82 | |
| 72 | 8 | 1.0% |
| 74 | 9 | 1.1% |
| Value | Count | Frequency (%) |
| 110 | 4 | 0.5% |
| 106 | 2 | 0.2% |
| 105 | 3 | 0.4% |
| 104 | 1 | 0.1% |
| 102 | 1 | 0.1% |
| 100 | 59 | |
| 98 | 12 | 1.5% |
| 96 | 7 | 0.8% |
| 95 | 21 | 2.5% |
| 94 | 12 | 1.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 57 |
| Missing (%) | 6.9% |
| Memory size | 6.6 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2310 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 481 | |
| 1.0 | 289 | |
| (Missing) | 57 | 6.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 481 | |
| 1.0 | 289 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1251 | |
| . | 770 | |
| 1 | 289 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1540 | |
| Other Punctuation | 770 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1251 | |
| 1 | 289 | 18.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 770 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2310 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1251 | |
| . | 770 | |
| 1 | 289 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2310 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1251 | |
| . | 770 | |
| 1 | 289 | 12.5% |
| Distinct | 51 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 60 |
| Missing (%) | 7.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9065189048 |
| Minimum | -2.6 |
|---|---|
| Maximum | 6.2 |
| Zeros | 322 |
| Zeros (%) | 38.9% |
| Negative | 9 |
| Negative (%) | 1.1% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | -2.6 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.6 |
| Q3 | 1.6 |
| 95-th percentile | 3 |
| Maximum | 6.2 |
| Range | 8.8 |
| Interquartile range (IQR) | 1.6 |
Descriptive statistics
| Standard deviation | 1.091713032 |
|---|---|
| Coefficient of variation (CV) | 1.204291522 |
| Kurtosis | 1.072110363 |
| Mean | 0.9065189048 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | 1.024891266 |
| Sum | 695.3 |
| Variance | 1.191837344 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 322 | |
| 1 | 73 | 8.8% |
| 2 | 66 | 8.0% |
| 1.5 | 45 | 5.4% |
| 3 | 27 | 3.3% |
| 2.5 | 16 | 1.9% |
| 1.4 | 15 | 1.8% |
| 0.5 | 15 | 1.8% |
| 1.6 | 14 | 1.7% |
| 0.6 | 14 | 1.7% |
| Other values (41) | 160 | |
| (Missing) | 60 | 7.3% |
| Value | Count | Frequency (%) |
| -2.6 | 1 | 0.1% |
| -1.5 | 1 | 0.1% |
| -1.1 | 1 | 0.1% |
| -1 | 1 | 0.1% |
| -0.9 | 1 | 0.1% |
| -0.8 | 1 | 0.1% |
| -0.7 | 1 | 0.1% |
| -0.5 | 1 | 0.1% |
| -0.1 | 1 | 0.1% |
| 0 | 322 |
| Value | Count | Frequency (%) |
| 6.2 | 1 | 0.1% |
| 5.6 | 1 | 0.1% |
| 5 | 1 | 0.1% |
| 4.2 | 2 | 0.2% |
| 4 | 7 | |
| 3.8 | 1 | 0.1% |
| 3.7 | 1 | 0.1% |
| 3.6 | 4 | |
| 3.5 | 2 | 0.2% |
| 3.4 | 2 | 0.2% |
| Distinct | 121 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 71 |
| Missing (%) | 8.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.475 |
| Minimum | 2 |
|---|---|
| Maximum | 36 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.6 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 5.875 |
| Q1 | 9.775 |
| median | 13 |
| Q3 | 17 |
| 95-th percentile | 23 |
| Maximum | 36 |
| Range | 34 |
| Interquartile range (IQR) | 7.225 |
Descriptive statistics
| Standard deviation | 5.408822508 |
|---|---|
| Coefficient of variation (CV) | 0.4013968466 |
| Kurtosis | 0.1574806757 |
| Mean | 13.475 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.5151015957 |
| Sum | 10187.1 |
| Variance | 29.25536093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 49 | 5.9% |
| 20 | 46 | 5.6% |
| 10 | 41 | 5.0% |
| 15 | 40 | 4.8% |
| 13 | 37 | 4.5% |
| 12 | 35 | 4.2% |
| 9 | 35 | 4.2% |
| 17 | 34 | 4.1% |
| 14 | 29 | 3.5% |
| 18 | 29 | 3.5% |
| Other values (111) | 381 | |
| (Missing) | 71 | 8.6% |
| Value | Count | Frequency (%) |
| 2 | 1 | 0.1% |
| 2.4 | 3 | 0.4% |
| 2.8 | 1 | 0.1% |
| 3 | 4 | |
| 3.5 | 1 | 0.1% |
| 4 | 9 | |
| 4.1 | 1 | 0.1% |
| 4.5 | 1 | 0.1% |
| 4.7 | 1 | 0.1% |
| 4.8 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 36 | 1 | 0.1% |
| 30 | 3 | 0.4% |
| 29 | 2 | 0.2% |
| 28 | 2 | 0.2% |
| 27 | 5 | |
| 26 | 4 | |
| 25.3 | 1 | 0.1% |
| 25.2 | 1 | 0.1% |
| 25 | 8 | |
| 24 | 7 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 2 |
| Missing (%) | 0.2% |
| Memory size | 6.6 KiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | |
| 3.0 | |
| 4.0 | 39 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2475 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 3.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 403 | |
| 1.0 | 168 | |
| 2.0 | 108 | 13.1% |
| 3.0 | 107 | 12.9% |
| 4.0 | 39 | 4.7% |
| (Missing) | 2 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 403 | |
| 1.0 | 168 | |
| 2.0 | 108 | 13.1% |
| 3.0 | 107 | 13.0% |
| 4.0 | 39 | 4.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1228 | |
| . | 825 | |
| 1 | 168 | 6.8% |
| 2 | 108 | 4.4% |
| 3 | 107 | 4.3% |
| 4 | 39 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1650 | |
| Other Punctuation | 825 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1228 | |
| 1 | 168 | 10.2% |
| 2 | 108 | 6.5% |
| 3 | 107 | 6.5% |
| 4 | 39 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 825 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2475 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1228 | |
| . | 825 | |
| 1 | 168 | 6.8% |
| 2 | 108 | 4.4% |
| 3 | 107 | 4.3% |
| 4 | 39 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2475 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1228 | |
| . | 825 | |
| 1 | 168 | 6.8% |
| 2 | 108 | 4.4% |
| 3 | 107 | 4.3% |
| 4 | 39 | 1.6% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.6 KiB |
| hungarian | |
|---|---|
| cleveland | |
| long-beach-va | |
| switzerland |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 10.08827086 |
| Min length | 9 |
Characters and Unicode
| Total characters | 8343 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | hungarian |
|---|---|
| 2nd row | hungarian |
| 3rd row | hungarian |
| 4th row | hungarian |
| 5th row | hungarian |
Common Values
| Value | Count | Frequency (%) |
| hungarian | 295 | |
| cleveland | 282 | |
| long-beach-va | 200 | |
| switzerland | 50 | 6.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| hungarian | 295 | |
| cleveland | 282 | |
| long-beach-va | 200 | |
| switzerland | 50 | 6.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1322 | |
| n | 1122 | |
| e | 814 | |
| l | 814 | |
| h | 495 | 5.9% |
| g | 495 | 5.9% |
| c | 482 | 5.8% |
| v | 482 | 5.8% |
| - | 400 | 4.8% |
| r | 345 | 4.1% |
| Other values (9) | 1572 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7943 | |
| Dash Punctuation | 400 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1322 | |
| n | 1122 | |
| e | 814 | |
| l | 814 | |
| h | 495 | 6.2% |
| g | 495 | 6.2% |
| c | 482 | 6.1% |
| v | 482 | 6.1% |
| r | 345 | 4.3% |
| i | 345 | 4.3% |
| Other values (8) | 1227 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7943 | |
| Common | 400 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1322 | |
| n | 1122 | |
| e | 814 | |
| l | 814 | |
| h | 495 | 6.2% |
| g | 495 | 6.2% |
| c | 482 | 6.1% |
| v | 482 | 6.1% |
| r | 345 | 4.3% |
| i | 345 | 4.3% |
| Other values (8) | 1227 |
Common
| Value | Count | Frequency (%) |
| - | 400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8343 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1322 | |
| n | 1122 | |
| e | 814 | |
| l | 814 | |
| h | 495 | 5.9% |
| g | 495 | 5.9% |
| c | 482 | 5.8% |
| v | 482 | 5.8% |
| - | 400 | 4.8% |
| r | 345 | 4.1% |
| Other values (9) | 1572 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | age | sex | cp | trestbps | htn | fbs | restecg | pro | met | thalach | thalrest | tpeakbps | tpeakbpd | trestbpd | exang | oldpeak | rldv5e | num | dataset | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 40.0 | 1.0 | 2.0 | 140.0 | 0.0 | 0.0 | 0.0 | 0.0 | 7.0 | 172.0 | 86.0 | 200.0 | 110.0 | 86.0 | 0.0 | 0.0 | 20.0 | 0.0 | hungarian |
| 1 | 1 | 49.0 | 0.0 | 3.0 | 160.0 | 1.0 | 0.0 | 0.0 | 0.0 | 7.0 | 156.0 | 100.0 | 220.0 | 106.0 | 90.0 | 0.0 | 1.0 | 13.0 | 1.0 | hungarian |
| 2 | 2 | 37.0 | 1.0 | 2.0 | 130.0 | 0.0 | 0.0 | 1.0 | 0.0 | 5.0 | 98.0 | 58.0 | 180.0 | 100.0 | 80.0 | 0.0 | 0.0 | 14.0 | 0.0 | hungarian |
| 3 | 3 | 48.0 | 0.0 | 4.0 | 138.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.0 | 108.0 | 54.0 | 210.0 | 106.0 | 86.0 | 1.0 | 1.5 | 22.0 | 3.0 | hungarian |
| 4 | 4 | 54.0 | 1.0 | 3.0 | 150.0 | 0.0 | 0.0 | 0.0 | 1.0 | 3.0 | 122.0 | 74.0 | 130.0 | 100.0 | 90.0 | 0.0 | 0.0 | 9.0 | 0.0 | hungarian |
| 5 | 5 | 39.0 | 1.0 | 3.0 | 120.0 | 0.0 | 0.0 | 0.0 | 0.0 | 8.0 | 170.0 | 86.0 | 198.0 | 100.0 | 80.0 | 0.0 | 0.0 | 21.0 | 0.0 | hungarian |
| 6 | 6 | 45.0 | 0.0 | 2.0 | 130.0 | 0.0 | 0.0 | 0.0 | 0.0 | 10.0 | 170.0 | 90.0 | 200.0 | 106.0 | 84.0 | 0.0 | 0.0 | 11.0 | 0.0 | hungarian |
| 7 | 7 | 54.0 | 1.0 | 2.0 | 110.0 | 0.0 | 0.0 | 0.0 | 0.0 | 7.0 | 142.0 | 56.0 | 220.0 | 70.0 | 70.0 | 0.0 | 0.0 | 11.0 | 0.0 | hungarian |
| 8 | 8 | 37.0 | 1.0 | 4.0 | 140.0 | 1.0 | 0.0 | 0.0 | 0.0 | 7.0 | 130.0 | 63.0 | 190.0 | 100.0 | 80.0 | 1.0 | 1.5 | 19.0 | 1.0 | hungarian |
| 9 | 9 | 48.0 | 0.0 | 2.0 | 120.0 | 0.0 | 0.0 | 0.0 | 0.0 | 4.0 | 120.0 | 72.0 | 140.0 | 80.0 | 80.0 | 0.0 | 0.0 | 6.0 | 0.0 | hungarian |
Last rows
| df_index | age | sex | cp | trestbps | htn | fbs | restecg | pro | met | thalach | thalrest | tpeakbps | tpeakbpd | trestbpd | exang | oldpeak | rldv5e | num | dataset | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 817 | 891 | 62.0 | 1.0 | 4.0 | 160.0 | 1.0 | 1.0 | 1.0 | 1.0 | 2.5 | 108.0 | 69.0 | 160.0 | 90.0 | 80.0 | 1.0 | 3.0 | 19.0 | 4.0 | long-beach-va |
| 818 | 892 | 53.0 | 1.0 | 4.0 | 144.0 | 1.0 | 1.0 | 1.0 | 0.0 | 5.0 | 128.0 | 76.0 | 150.0 | 102.0 | 94.0 | 1.0 | 1.5 | 13.0 | 3.0 | long-beach-va |
| 819 | 893 | 62.0 | 1.0 | 4.0 | 158.0 | 1.0 | 0.0 | 1.0 | 0.0 | 8.0 | 138.0 | 86.0 | 202.0 | 98.0 | 90.0 | 1.0 | 0.0 | 22.0 | 1.0 | long-beach-va |
| 820 | 894 | 46.0 | 1.0 | 4.0 | 134.0 | 1.0 | 0.0 | 0.0 | 0.0 | 7.0 | 126.0 | 88.0 | 174.0 | 114.0 | 90.0 | 0.0 | 0.0 | 7.0 | 2.0 | long-beach-va |
| 821 | 895 | 54.0 | 0.0 | 4.0 | 127.0 | 0.0 | 1.0 | 1.0 | 0.0 | 8.0 | 154.0 | 83.0 | 158.0 | 84.0 | 78.0 | 0.0 | 0.0 | 20.0 | 1.0 | long-beach-va |
| 822 | 896 | 62.0 | 1.0 | 1.0 | NaN | 0.0 | 0.0 | 1.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | long-beach-va |
| 823 | 897 | 55.0 | 1.0 | 4.0 | 122.0 | 1.0 | 1.0 | 1.0 | 0.0 | 5.0 | 100.0 | 74.0 | 210.0 | 100.0 | 70.0 | 0.0 | 0.0 | 4.0 | 2.0 | long-beach-va |
| 824 | 898 | 58.0 | 1.0 | 4.0 | NaN | 0.0 | 1.0 | 2.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 | long-beach-va |
| 825 | 899 | 62.0 | 1.0 | 2.0 | 120.0 | 1.0 | 0.0 | 2.0 | 0.0 | 7.0 | 93.0 | 67.0 | 164.0 | 110.0 | 80.0 | 1.0 | 0.0 | 17.0 | 1.0 | long-beach-va |
| 826 | 900 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | long-beach-va |